435 research outputs found

    A method for finding putative causes of gene expression variation

    Get PDF
    The majority of microarray studies evaluate gene ex- pression differences between various specimens or con- ditions. However, the causes of this variability often re- main unknown. Our aim is to identify underlying causes of these patterns, a process that would eventually enable a mechanistic understanding of the deregulation of gene expression in cancer. The procedure consists of three phases: pre-processing, data integration and statistical analysis. We have applied the strategy to identify genes that are overexpressed due to amplification in breast cancer. The data were obtained from 14 breast cancer cell lines, which were subjected to cDNA microarray based copy number and expression experiments. The re- sult of the analysis was a list that consisted of 92 genes. This set includes several genes that are known to be both overexpressed and amplified in breast cancer. The com- plete study was published in Journal of the Franklin In- stitute 2004, and in this paper we focus on the main issues of the study

    Classification of unknown primary tumors with a data-driven method based on a large microarray reference database

    Get PDF
    We present a new method to analyze cancer of unknown primary origin (CUP) samples. Our method achieves good results with classification accuracy (88% leave-one-out cross validation for primary tumors from 56 categories, 78% for CUP samples), and can also be used to study CUP samples on a gene-by-gene basis. It is not tied to any a priori defined gene set as many previous methods, and is adaptable to emerging new information

    Clinical relevance of integrin alpha 4 in gastrointestinal stromal tumours

    Get PDF
    The molecular mechanisms for the dissemination and metastasis of gastrointestinal stromal tumours (GIST) are incompletely understood. The purpose of the study was to investigate the clinical relevance of integrin alpha 4 (ITGA4) expression in GIST. GIST transcriptomes were first compared with transcriptomes of other types of cancer and histologically normal gastrointestinal tract tissue in the MediSapiens in silico database. ITGA4 was identified as an unusually highly expressed gene in GIST. Therefore, the effects of ITGA4 knock-down and selective integrin alpha 4 beta 1 (VLA-4) inhibitors on tumour cell proliferation and invasion were investigated in three GIST cell lines. In addition, the prognostic role of ITGA4 expression in cancer cells was investigated in a series of 147 GIST patients with immunohistochemistry. Inhibition of ITGA4-related signalling decreased GIST cell invasion in all investigated GIST cell lines. ITGA4 protein was expressed in 62 (42.2%) of the 147 GISTs examined, and expression was significantly associated with distant metastases during the course of the disease and several adverse prognostic features. Patients whose GIST expressed strongly ITGA4 had unfavourable GIST-specific survival and overall survival compared to patients with low or no ITGA4 expression. Taken together, ITGA4 is an important integrin in the molecular pathogenesis of GIST and may influence their clinical behaviour.Peer reviewe

    The gene expression landscape of breast cancer is shaped by tumor protein p53 status and epithelial-mesenchymal transition

    Get PDF
    Introduction: Gene expression data derived from clinical cancer specimens provide an opportunity to characterize cancer-specific transcriptional programs. Here, we present an analysis delineating a correlation-based gene expression landscape of breast cancer that identifies modules with strong associations to breast cancer-specific and general tumor biology. Methods: Modules of highly connected genes were extracted from a gene co-expression network that was constructed based on Pearson correlation, and module activities were then calculated using a pathway activity score. Functional annotations of modules were experimentally validated with an siRNA cell spot microarray system using the KPL-4 breast cancer cell line, and by using gene expression data from functional studies. Modules were derived using gene expression data representing 1,608 breast cancer samples and validated in data sets representing 971 independent breast cancer samples as well as 1,231 samples from other cancer forms. Results: The initial co-expression network analysis resulted in the characterization of eight tightly regulated gene modules. Cell cycle genes were divided into two transcriptional programs, and experimental validation using an siRNA screen showed different functional roles for these programs during proliferation. The division of the two programs was found to act as a marker for tumor protein p53 (TP53) gene status in luminal breast cancer, with the two programs being separated only in luminal tumors with functional p53 (encoded by TP53). Moreover, a module containing fibroblast and stroma-related genes was highly expressed in fibroblasts, but was also up-regulated by overexpression of epithelial-mesenchymal transition factors such as transforming growth factor beta 1 (TGF-beta1) and Snail in immortalized human mammary epithelial cells. Strikingly, the stroma transcriptional program related to less malignant tumors for luminal disease and aggressive lymph node positive disease among basal-like tumors. Conclusions: We have derived a robust gene expression landscape of breast cancer that reflects known subtypes as well as heterogeneity within these subtypes. By applying the modules to TP53-mutated samples we shed light on the biological consequences of non-functional p53 in otherwise low-proliferating luminal breast cancer. Furthermore, as in the case of the stroma module, we show that the biological and clinical interpretation of a set of co-regulated genes is subtype-dependent

    The impact of RNA sequence library construction protocols on transcriptomic profiling of leukemia

    Get PDF
    Background: RNA sequencing (RNA-seq) has become an indispensable tool to identify disease associated transcriptional profiles and determine the molecular underpinnings of diseases. However, the broad adaptation of the methodology into the clinic is still hampered by inconsistent results from different RNA-seq protocols and involves further evaluation of its analytical reliability using patient samples. Here, we applied two commonly used RNA-seq library preparation protocols to samples from acute leukemia patients to understand how poly-A-tailed mRNA selection (PA) and ribo-depletion (RD) based RNA-seq library preparation protocols affect gene fusion detection, variant calling, and gene expression profiling. Results: Overall, the protocols produced similar results with consistent outcomes. Nevertheless, the PA protocol was more efficient in quantifying expression of leukemia marker genes and showed better performance in the expression-based classification of leukemia. Independent qRT-PCR experiments verified that the PA protocol better represented total RNA compared to the RD protocol. In contrast, the RD protocol detected a higher number of non-coding RNA features and had better alignment efficiency. The RD protocol also recovered more known fusion-gene events, although variability was seen in fusion gene predictions. Conclusion: The overall findings provide a framework for the use of RNA-seq in a precision medicine setting with limited number of samples and suggest that selection of the library preparation protocol should be based on the objectives of the analysis.Peer reviewe

    High-throughput cell-based compound screen identifies pinosylvin methyl ether and tanshinone IIA as inhibitors of castration-resistant prostate cancer

    Get PDF
    Current treatment options for castration-resistant prostate cancer (CRPC) are limited. In this study, a high-throughput screen of 4910 drugs and drug-like molecules was performed to identify antiproliferative compounds in androgen ablated prostate cancer cells. The effect of compounds on cell viability was compared in androgen ablated LNCaP prostate cancer cells and in LNCaP cells grown in presence of androgens as well as in two non-malignant prostate epithelial cells (RWPE-1 and EP156T). Validation experiments of cancer specific anti-proliferative compounds indicated pinosylvin methyl ether (PSME) and tanshinone IIA as potent inhibitors of androgen ablated LNCaP cell proliferation. PSME is a stilbene compound with no previously described antineoplastic activity whereas tanshinone IIA is currently used in cardiovascular disorders and proposed as a cancer drug. To gain insights into growth inhibitory mechanisms in CRPC, genome-wide gene expression analysis was performed in PSME- and tanshinone IIA-exposed cells. Both compounds altered the expression of genes involved in cell cycle and steroid and cholesterol biosynthesis in androgen ablated LNCaP cells. Decrease in androgen signalling was confirmed by reduced expression of androgen receptor and prostate specific antigen in PSME- or tanshinone IIA-exposed cells. Taken together, this systematic screen identified a novel anti-proliferative agent, PSME, for CRPC. Moreover, our screen confirmed tanshinone IIA as well as several other compounds as potential prostate cancer growth inhibitors also in androgen ablated prostate cancer cells. These results provide valuable starting points for preclinical and clinical studies for CRPC treatment

    Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data

    Get PDF
    Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data Kilpinen, Sami K Ojala, Kalle A Kallioniemi, Olli P England BioData mining BioData Min. 2011 Mar 31;4:5. engBACKGROUND: Gene expression microarray data have been organized and made available as public databases, but the utilization of such highly heterogeneous reference datasets in the interpretation of data from individual test samples is not as developed as e.g. in the field of nucleotide sequence comparisons. We have created a rapid and powerful approach for the alignment of microarray gene expression profiles (AGEP) from test samples with those contained in a large annotated public reference database and demonstrate here how this can facilitate interpretation of microarray data from individual samples. METHODS: AGEP is based on the calculation of kernel density distributions for the levels of expression of each gene in each reference tissue type and provides a quantitation of the similarity between the test sample and the reference tissue types as well as the identity of the typical and atypical genes in each comparison. As a reference database, we used 1654 samples from 44 normal tissues (extracted from the Genesapiens database). RESULTS: Using leave-one-out validation, AGEP correctly defined the tissue of origin for 1521 (93.6%) of all the 1654 samples in the original database. Independent validation of 195 external normal tissue samples resulted in 87% accuracy for the exact tissue type and 97% accuracy with related tissue types. AGEP analysis of 10 Duchenne muscular dystrophy (DMD) samples provided quantitative description of the key pathogenetic events, such as the extent of inflammation, in individual samples and pinpointed tissue-specific genes whose expression changed (SAMD4A) in DMD. AGEP analysis of microarray data from adipocytic differentiation of mesenchymal stem cells and from normal myeloid cell types and leukemias provided quantitative characterization of the transcriptomic changes during normal and abnormal cell differentiation. CONCLUSIONS: The AGEP method is a widely applicable method for the rapid comprehensive interpretation of microarray data, as proven here by the definition of tissue- and disease-specific changes in gene expression as well as during cellular differentiation. The capability to quantitatively compare data from individual samples against a large-scale annotated reference database represents a widely applicable paradigm for the analysis of all types of high-throughput data. AGEP enables systematic and quantitative comparison of gene expression data from test samples against a comprehensive collection of different cell/tissue types previously studied by the entire research community.Peer reviewe
    corecore